Eigenvalues and Eigenvectors

5.1 Eigenvectors and Eigenvalues

Eigenvector

Nonzero vector $x$ such that

Ax=\lambda x

where $\lambda$ is the eigenvalue corresponding to the eigenvector $x$ .

Given an eigenvalue $\lambda$ , we can rearrange the above equation and solve for its corresponding eigenvector as a solution to the homogeneous system:

Ax-\lambda x = 0\\ (A-\lambda I)x = 0

Eigenspace

The set of all solutions $x$ to $(A-\lambda_i I)x = 0$ is called the eigenspace of $A$ corresponding to an eigenvalue $\lambda_i$ , which includes $\vec 0$ and all the eigenvectors $x$ corresponding to $\lambda_i$ .

Theorem 1

The eigenvalues of a triangular matrix are the entries on its main diagonal.

Theorem 2

If $v_1,\dots,v_r$ are eigenvectors that correspond to distinct eigenvalues $\lambda_1,\dots,\lambda_r$ of an $n\times n$ matrix $A$ , then the set $\{v_1,\dots,v_r\}$ is linearly independent.

Intuitively, this makes sense because we can’t have two different stretching factors $\lambda_1$ and $\lambda_2$ along the same eigenvector $x$ .

Questions

Is $\lambda=3$ and eigenvalue of $A = \begin{bmatrix} 1 & 2 & 2\\ 3 & -2 & 1\\ 0 & 1 & 1 \end{bmatrix}$ ? If so, find one corresponding eigenvector.

Starting with the definition of an eigenvector:

Ax =\lambda x \\

We rearrange the equation

(A-\lambda I)x = 0

Now, $\lambda=3$ is an eigenvalue only if the following system has a non-zero solution $x$ :

(A-3I)x = 0

We find a solution by row-reducing the matrix $A-3I$

A - 3I = \begin{bmatrix} 1-3 & 2 & 2\\ 3 & -2-3 & 1\\ 0 & 1 & 1-3 \end{bmatrix}\\ \sim \begin{bmatrix} 1 & 0 & 0\\ 0 & 1 & 0\\ 0 & 0 & 1 \end{bmatrix}

This system has only the trivial solution $x = \vec 0$ , which is not a valid eigenvector by definition. Thus, $3$ is not an eigenvalue of $A$ .

Find a basis for the eigenspace corresponding to $\lambda = -2,5$  of the matrix $A = \begin{bmatrix} 1 & 4\\ 3 & 2 \end{bmatrix}$ 

The eigenspace is the set of all solutions $x$ to the homogeneous equation $(A-\lambda_i I)x=\vec 0$ . Using the given eigenvalues:

(A-(-2)I)x=\vec 0\\ \begin{bmatrix} 3 & 4\\ 3 & 4 \end{bmatrix}x = 0 \\ \to \vec x = \begin{bmatrix} -1\\1 \end{bmatrix}

So the eigenspace corresponding to $\lambda = -2$ is all vectors along $\begin{bmatrix} -1\\1 \end{bmatrix}$

Similarly, for $\lambda = 5$

(A-5I)x=\vec 0\\ \begin{bmatrix} -4 & 4\\ 3 & -3 \end{bmatrix}x = 0 \\ \to \vec x = \begin{bmatrix} 1\\1 \end{bmatrix}

True or False: To find the eigenvalues of $A$ , reduce $A$  to echelon form.

False. Row operations change the characteristic polynomial, thus changing the eigenvalues. We find eigenvalues by solving for the roots of the characteristic polynomial $\det(A-\lambda I)=0$

5.2 The Characteristic Equation

The eigenvalues of matrix $A$ are all scalars $\lambda_i$ such that the homogeneous system

(A-\lambda I)x = 0

has a nontrivial solution $x \neq \vec 0$ . This problem is equivalent to finding all $\lambda_i$ such that the matrix $A-\lambda I$ is not invertible (by Invertible Matrix Theorem the homogeneous system only has a nontrivial solution when the matrix is not invertible). We know that a matrix is not invertible iff:

\det{(A-\lambda I)} = 0

The above equation is called the characteristic equation. The algebraic multiplicity of an eigenvalue $\lambda_i$ is its multiplicity as a root of the characteristic equation.

Theorem 4

If $n \times n$ matrices $A$ and $B$ are similar, then they have the same characteristic polynomial and hence the same eigenvalues with the same multiplicities.

Note: Similarity is not the same as row equivalence.

Questions

Find the characteristic polynomial and eigenvalues of $A = \begin{bmatrix}7 & -2 \\ 2 & 3\end{bmatrix}$

\det{(A-\lambda I)} = 0\\ \begin{vmatrix} 7-\lambda & -2 \\ 2 & 3-\lambda \end{vmatrix} = 0\\ (7-\lambda)(3-\lambda)+4 = 0

Given the above characteristic polynomial, we solve for eigenvalues by finding the roots.

\lambda^2 -10\lambda + 25 = 0\\ (\lambda-5)^2 = 0

We find that $A$ has the eigenvalue $\lambda=5$ with algebraic multiplicity (exponent) $2$ .

List the eigenvalues of $A$ with their multiplicities.

A = \begin{bmatrix} 5 & 0 & 0 & 0\\ 8 & -4 & 0 & 0\\ 0 & 7 & 1 & 0\\ 1 & -5 & 2 & 1 \end{bmatrix}

By Theorem 1, we know that the eigenvalues of a triangular matrix are the entries on its diagonal. Thus, $\lambda = 1,1,-4,5$ .

Find $h$  in the matrix $A$  such that the eigenspace of $\lambda=5$  is $2$ -dimensional.

A = \begin{bmatrix} 5 & -2 & 6 & -1\\ 0 & 3 & h & 0\\ 0 & 0 & 5 & 4\\ 0 & 0 & 0 & 1 \end{bmatrix}

The eigenspace of an eigenvalue is the set of all solutions $x$ to $(A-\lambda_i I)x = 0$ .

\begin{bmatrix} 0 & -2 & 6 & -1\\ 0 & -2 & h & 0\\ 0 & 0 & 0 & 4\\ 0 & 0 & 0 & -4 \end{bmatrix} \begin{bmatrix}x_1\\x_2\\x_3\\x_4\end{bmatrix} = 0

\sim \begin{bmatrix} 0 & 1 & -3 & 0\\ 0 & 0 & h-6 & 0\\ 0 & 0 & 0 & 1\\ 0 & 0 & 0 & 0 \end{bmatrix} \begin{bmatrix}x_1\\x_2\\x_3\\x_4\end{bmatrix} = 0

If $h=6$ , then this homogeneous system has $2$ free variables, thus making it $2$ -dimensional.

Use a property of determinants to show that $A$  and $A^\top$  have the same characteristic polynomial.

Let $B = A-\lambda I$ :

\det{B} = \det{B^\top}\\ \det{(A-\lambda I)} = \det{(A^\top - \lambda I)}

5.3 Diagonalization

Theorem 5: The Diagonalization Theorem

An $n \times n$ matrix $A$ is diagonalizable iff $A$ has $n$ linearly independent eigenvectors.

A = PDP^{-1}

where the $n$ eigenvectors form the $P$ matrix, and $D$ is a diagonal matrix with the eigenvalues of $A$ .

Example: Diagonalize $A = \begin{bmatrix}1 & 3 & 3\\ -3 & -5 & -3\\ 3 & 3 &1\end{bmatrix}$

First, we find the eigenvalues of $A$ as the roots of the characteristic polynomial:

\det{(A-\lambda I)} = 0\\ \begin{vmatrix} 1-\lambda & 3 & 3\\ -3 & -5 -\lambda & -3\\ 3 & 3 & 1-\lambda \end{vmatrix} = 0\\ (1-\lambda)((-5-\lambda)(1-\lambda)+9) -3((1-\lambda)(-3)+9) + 3(-9 +3(5+ \lambda)) = 0\\ \dots \\ -(\lambda-1)(\lambda+2)^2=0\\ \lambda = 1,-2

Then we find the eigenvalues corresponding to the two eigenvalues by plugging them into the homogeneous equation $(A-\lambda I)x = 0$ where the nonzero solutions $x$ to this homogeneous system are eigenvectors corresponding to the eigenvalue. By the Diagonalization Theorem, there must be exactly $n$ linearly independent eigenvectors for a matrix to be diagonalizable.

\lambda=1: \begin{bmatrix} 1\\-1\\1 \end{bmatrix}\\ \lambda=-2 \begin{bmatrix} -1\\1\\0 \end{bmatrix}, \begin{bmatrix} -1\\0\\1 \end{bmatrix}

Now we construct $P$ and $D$ :

P = \begin{bmatrix} 1 & -1 & -1\\ -1 & 1 & 0\\ 1 & 0 & 1 \end{bmatrix}\\ D = \begin{bmatrix} 1 & 0 & 0\\ 0 & -2 & 0\\ 0 & 0 & -2 \end{bmatrix}

Theorem 6

An $n \times n$ matrix with $n$ distinct eigenvalues is diagonalizable because it guarantees $n$ linearly independent eigenvectors. This is a sufficient condition for a matrix to be diagonalizable. However, it doesn’t mean that a diagonalizable matrix must have $n$ distinct eigenvalues.

Questions

Diagonalize $A=\begin{bmatrix}2 & 3 \\4 & 1\end{bmatrix}$

We find diagonal matrix $D$ such that its entries are the two eigenvalues, and a matrix $P$ such that its $2$ columns are the $2$ linearly independent eigenvectors corresponding to the eigenvalues.

We find the eigenvalues as roots to the characteristic polynomial:

\det(A-\lambda I) = 0\\ (1-\lambda)(2-\lambda)-12 = 0\\ \lambda^2 -3\lambda -10 = 0\\ (\lambda-5)(\lambda+2) = 0\\ \lambda = 5,-2\\

The matrix $A$ has $2$ distinct eigenvalues, which is sufficient for diagonalization by Theorem 6 because it guarantees $2$ linearly independent eigenvectors which guarantees that the matrix is diagonalizable by the Diagonalization Theorem.

D = \begin{bmatrix} 5 & 0\\ 0 & -2 \end{bmatrix}

We find the two eigenvectors as solutions to the homogeneous equations of their corresponding eigenvalues

(A-\lambda I)x = 0\\ \lambda=5: \begin{bmatrix} -3 & 3\\ 4 & -4\end{bmatrix}x = 0 \to x = \begin{bmatrix}1\\1\end{bmatrix}\\ \lambda=-2: \begin{bmatrix} 4 & 3\\ 4 & 3\end{bmatrix}x = 0 \to x = \begin{bmatrix}-1\\1\end{bmatrix}

Diagonalize $A=\begin{bmatrix} 0 & -4 & -6\\ -1 & 0 & -3\\ 1 & 2 & 5 \end{bmatrix}$ , given $\lambda=2,1$ 

We’re given the eigenvalues but not their multiplicities which is a bit annoying but it’s ok. Let’s just find the eigenvectors as the solution to the homogeneous system

(A-\lambda I)x = 0\\ \lambda=2: \begin{bmatrix} -2 & -4 & -6\\ -1 & -2 & -3\\ 1 & 2 & 3 \end{bmatrix} \to x = \begin{bmatrix} -2\\1\\0 \end{bmatrix}, \begin{bmatrix} -3\\0\\1 \end{bmatrix}\\ \lambda=1: \begin{bmatrix} -1 & -4 & -6\\ -1 & -1 & -3\\ 1 & 2 & 4 \end{bmatrix} \to x = \begin{bmatrix} -2\\-1\\1 \end{bmatrix}

We have $3$ linearly independent eigenvectors, thus the matrix is diagonalizable by the Diagonalization Theorem:

P = \begin{bmatrix} -2 & -3 & -2 \\ 1 & 0 & -1\\ 0 & 1 & 1 \end{bmatrix} \\ D = \begin{bmatrix} 2 & 0 & 0\\ 0 & 2 & 0\\ 0 & 0 & 1 \end{bmatrix}

True or False: If $A$ is diagonalizable, then $A$  is invertible.

False. A diagonalizable matrix may have $\lambda =0$ as an eigenvalue, which would make it non-invertible.

5.4 Eigenvectors and Linear Transformations

Theorem 8: Diagonal Matrix Represenation

Suppose $A$ is diagonalizble as $A=PDP^{-1}$ . The columns of $P$ form a basis for $\R^n$ , thus $D$ is the $P$ -coordinate matrix $[T]_P$ for the transformation $x \mapsto Ax$ .

Informally, this theorem says that a matrix diagonalization is expressing the matrix as a change of coordinates into the eigenspace, scaling along the axes (eigenvectors), and changing back into the original coordinate space. Where $D$ expresses the transformation $T$ in the eigenbasis $P$ .

Thus, we can generalize $T(x) = Ax$ to non-standard basis $\mathcal{B}$ :

[T(x)]_\mathcal{B} = [T]_\mathcal{B}[x]_\mathcal{B}

Proof

We have the eigenbasis $P=\begin{bmatrix}b_1 & \dots & b_n\end{bmatrix}$ . We can show that the matrix $D$ is the transformation $T$ in the eigenbasis $P$ :

[T]_P = D

First, we remember that, in the standard basis, the columns $a_i$ of a linear transformation matrix $A$ are defined as $T(e_i)$ :

A = \begin{bmatrix}T(e_1)\dots T(e_n)\end{bmatrix}

We can then extend this definition beyond the standard basis, to the eigenbasis $P$ :

[T]_P = [[T(b_1)]_P \dots[T(b_n)]_P]\\

Then, again using the definition $T(x) = Ax$ :

[T]_P = [[Ab_1]_P\dots [Ab_n]_P]\\

The change-of-coordinates matrix $P_{\mathcal{E} \leftarrow P}$ is just $P$ , so $P_{P\leftarrow \mathcal{E} } = P^{-1}$

[T]_P = [P^{-1}Ab_1\dots P^{-1}Ab_n]

Factoring out both matrices:

[T]_P = P^{-1}A[b_1\dots b_n]\\ [T]_P = P^{-1}AP = D

The Matrix of a Linear Transformation

Given the coefficients $[x]_\mathcal{B}$ of the basis we can find the coefficients of the transformed $T(x)$ as :

$[T(x)]_\mathcal{B}$ :

[T(x)]_\mathcal{B} = A[x]_\mathcal{B}

where $A$ contains the transformed basis vectors

A = [[T(b_1)]_\mathcal{B}\dots[T(b_n)]_\mathcal{B}]

This is just a generalization of $A = [T(e_1)…T(e_n)]$ in the standard basis. Notice that $A$ is not a change-of-coordinates matrix because both sides are coefficients of $\mathcal{B}$ -basis vectors.

Note: The set of all matrices similar to matrix $A$ is equivalent to the set of all matrix representations of the transformation $x \mapsto Ax$ .

Questions

Let $T: P_2 \to P_2$  by $T(p) = p(0) - p(1)t+p(2)t^2$ .

a. Show that $T$  is a linear transformation

First, let’s understand the problem. This linear transformation takes in a polynomial definition with a parameter $t$ , for example $p(t) = t$ . Then, we can plug this polynomial into the transformation as

T(p) = T(t)\\ = p(0)-p(1)t+p(2)t^2\\ p(0) = 0,p(1)=1,p(2)=2\\ \to T(p)=T(t) =0-1+2 = 1

To show that $T$ is a linear transformation, we show that it is covered under vector addition and scalar multiplication.

To show vector addition, we show that $T(p+q) = T(p)+T(q)$ :

T(p+q) = (p+q)(0)-(p+q)t + (p+q)(2)t^2\\ = p(0) + q(0) - p(1)t - q(1)t+p(2)t^2+q(2)t^2\\ = (p(0)-p(1)t+p(2)t^2)+(q(0)-q(1)t+q(2)t^2)\\ = T(p) + T(q)

To show scalar multiplication, we show that $T(cp) =cT(p)$ :

T(cp)=(cp)(0)-(cp)(1)t+(cp)(2)t^2\\ =c(p(0)-p(1)t+p(2)t^2)\\ =cT(p)

b. Find $T(p)$  when $p(t) = -2+t$ . Is $p$  an eigenvector of $T$ ?

First, we know that $T(p)$ has the terms $p(0),p(1),p(2)$ , so we compute them individually:

p(0) = -2+0 = -2\\ p(1) = -2 + 1 = -1\\ p(2) = -2 + 2 = 0

Then, we just plug these terms in:

T(p) = p(0) - p(1)t+p(2)t^2\\ T(-2+t) = -2-(-1)t +0t^2\\ T(-2+t)= -2+t

Thus, we find that $p(t) = -2+t$ is an eigenvector of this linear transformation with an eigenvalue of $1$ .

c. Find the matrix for $T$  relative to the basis $\{1,t,t^2\}$  for $P_2$ 

We know that the transformation matrix of a linear transformation is given by the transformed standard basis vectors:

M = [T(e_1)\dots T(e_n)]

Thus, using the standard basis vectors in $P_2$ :

\begin{bmatrix} 1\\0\\0 \end{bmatrix} = 1, \begin{bmatrix} 0\\1\\0 \end{bmatrix}=t, \begin{bmatrix} 0\\0\\1 \end{bmatrix} = t^2

T(1) = 1-1*t+1t^2\\ T(t) = 0-1t+2t^2\\ T(t^2) = 0 -1t +4t^2

M = \begin{bmatrix} 1 & 0 & 0\\ -1 & -1 & -1\\ 1 & 2 & 4 \end{bmatrix}

Let $\mathcal{B} = \{b_1,b_2,b_3\}$  be a basis for a vector space $V$ . Find $T(2b_1-b_2+4b_3)$  when $T$  is a linear transformation from $V$  to $V$  whose matrix relative to $\mathcal{B}$  is

[T]_\mathcal{B} = \begin{bmatrix} 0 & -6 & 1\\ 0 & 5 & -1\\ 1 & -2 & 7 \end{bmatrix}

By Theorem 1, $[T]_\mathcal{B}$ is just the generalization of the transformation matrix $A$ to non-standard basis $\mathcal{B}$ such that

[T(x)]_\mathcal{B} = [T]_\mathcal{B}[x]_\mathcal{B}

Thus:

[T(2b_1-b_2+4b_3)]_\mathcal{B} = [T]_\mathcal{B} [x]_\mathcal{B}\\ = \begin{bmatrix} 0 & -6 & 1\\ 0 & 5 & -1\\ 1 & -2 & 7 \end{bmatrix} \begin{bmatrix} 2\\-1\\4 \end{bmatrix}

Let $T: \R^2\to\R^2$  by $T(x)=Ax$ . Find a basis $\mathcal{B}$  for $\R^2$  with the property that $[T]_\mathcal{B}$  is diagonal.

A = \begin{bmatrix} 5 & -3\\ -7 & 1 \end{bmatrix}

By The Diagonal Matrix Representation Theorem, if a linear transformation $T: x \mapsto Ax$ is defined by a diagonalizable matrix $A=PDP^{-1}$ :

T: x\mapsto Ax\\ T: x \mapsto PDP^{-1}x

Where $P^{-1}$ is the change-of-coordinates matrix from the standard basis into the eigenbasis.

Thus, this problem is equivalent to diagonalizing the matrix $A$ where $P$ is the eigenbasis referred to as $\mathcal{B}$ .

To diagonalize $A$ , we first find its eigenvalues as the roots to the characteristic polynomial

\det(A-\lambda I)=0\\ (1-\lambda)(5-\lambda)-21 = 0\\ \to \lambda=-2,8\\ \to [T]_\mathcal{B}= D = \begin{bmatrix} -2 & 0\\0 & 8 \end{bmatrix}

We then find the eigenvectors as solutions to the homogeneous system

(A-\lambda I)x=0\\ \lambda=-2\to x= \begin{bmatrix} 3/7\\1 \end{bmatrix} \\ \lambda=5\to x= \begin{bmatrix} -1\\1 \end{bmatrix}

Thus, the diagonalization:

D = \begin{bmatrix} -2 & 0 \\ 0 & 8 \end{bmatrix}, P = \begin{bmatrix} 3/7 & -1\\1 & 1 \end{bmatrix}

Let $T: P_3 \to P_3$ by $T(p)=p(0)-p(1)t-p(1)t^2+p(0)t^3$

a. Find $T(p)$  when $p(t) = 1+t+t^2+t^3$ . Is $p$  an eigenvector of $T$ ? If $p$  is an eigenvector, what is its eigenvalue?

$T(p)$ has the terms $p(0)$ and $p(1)$ , so let’s compute them individually:

p(0) = 1\\ p(1) = 4

Now, plug them into $T(p)$ :

T(p) = 1 - 4t-4t^2 + 1t^3

Obviously, this polynomial is not an eigenvector of $T$ .

b. Find $T(p)$  when $p(t)=t+t^2$ . Is $p$  an eigenvector of $T$ ? If $p$  is an eigenvector, what is its eigenvalue?

Similarly

T(p) = 0 - 2t-2t^2 + 0\\ = -2(t+t^2)

Thus, (t+t^2) is an eigenvector of $T$ with an eigenvalue of $\lambda = -2$

True or False: If $A$  is similar to $B$ , then $A^2$  is similar to $B^2$ .

True.

A^2 = PB^2P^{-1}\\ P^{-1}A^2P = B^2

5.5 Complex Eigenvalues

A complex scalar eigenvalue $\lambda\in C^n$ satisfies $\det{(A - \lambda I)} = 0$ iff there is a nonzero vector $x$ in $C^n$ such that $Ax=\lambda x$ .

Example: The matrix $A=\begin{bmatrix}0 & -1\\1 & 0\end{bmatrix}$ rotates the plane counterclockwise and has no eigenvectors in $\R^2$ .

\lambda^2 +1 = 0\\ \lambda = \pm i

Theorem 9

Let $A$ be a real $2\times 2$ matrix with a complex eigenvalue $\lambda = a -bi$ and an associated eigenvector $v\in C^2$ , then

A = PCP^{-1}

where

P = [\text{Real } v, \text{Imaginary } v]\\ C = \begin{bmatrix} a & -b\\ b & a \end{bmatrix}

Scaling Factor

We compute the scaling factor $r$ of an imaginary eigenvalue $\lambda = a+bi$ :

r = |\lambda| = \sqrt{a^2+b^2}

Example: List the eigenvalues $A = \begin{bmatrix}3&3\\-3&3\end{bmatrix}$ . Give the scaling factor $r$  and angle of rotation $\phi$

We find the eigenvalues as the roots of the characteristic polynomial

(3-\lambda)^2+9=0\\ (3+3i-\lambda)(3-\lambda-3i)=0\\ \lambda = 3+3i,3-3i\\ \to r = |\lambda| = \sqrt{3^2+3^2} = 3\sqrt2

We then find the angle of rotation $\phi$ using the formula:

\tan{\phi}=\frac{Img}{Real}\\ tan\phi = 3/3 = 1\\ \phi = \pi/4

Questions

4, 10, 16